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1. Introduction 

After many years since their introduction, Cayley Trees (CT) [1] and Bethe Lattices 
(BL) [2] still play an important role as prototypes of graphs. In fact, even if one can 
say, from the point of graph theory, that these ideal graphs [J] are by now obsolete 
objects which have been replaced by more realistic random graphs, like the classical 
random graph since a few decades [8 J [§] and by complex networks more recently [4], 
their key feature, that is the fact they are exact tree, i.e., cycles- free, makes CT and 
BL very instructive examples where exact calculations can be done [S]. However, even 
though there are several excellent works on their applications, there is still a quite 
widespread confusion about their exact definition and use. In particular, by a rapid 
survey (September 2011), ranging from Wikipedia to famous textbooks in statistical 
mechanics, as well as many papers in referred journals of mathematics or physics, one 
finds statements claiming, for example, that the BL is the thermodynamic limit of the 
CT, or one finds that a BL is the interior of a large CT, that can be then analyzed 
by introducing large but finite subtrees, etc... Such false statements and misuses are 
not just formal mistakes, but serious conceptual misunderstandings that may lead in 
turn to fatal errors. The difference between a CT and a BL was emerged long ago in 
[3 [10] but, nevertheless, confusion on the subject remained around over the 
years, both among mathematicians and physicists. We think that the main reason for 
that is due to an ill mathematical approach to the BL, and to a scarce communication 
between the physics and mathematics communities. Moreover, at the time of the 
Refs. [3 HJ EJ], a proper nomenclature for the two kind of graphs was not yet 
consolidated, causing further confusion. 

The aim of this paper is to give a concise definition of CT and BL and to illustrate 
ambiguities-free mathematical tools to be used for statistical mechanical models built 
over CT and BL. We will show two rigorous techniques to deal with BL: self-similarity 
and the Kolmogorov consistency theorem, pointing out that the latter is equivalent to 
the Cavity and Belief Propagation methods, more known among physicists. 

2. Cayley Trees and Bethe Lattices: definition and basic properties 

Both CT and BL are simple connected undirected graphs G = (V, E) (V set of vertices, 
E set of edges) with no cycles (a cycle is a closed path of different edges), i.e., they 
are trees. 

A CT of order q with n shells is defined in the following way. Given a root vertex 
(0), we link (0) with q new vertices by means of q edges. This first set of q vertices 
constitutes the shell n = 1 of the CT. Then, to build the shell I > 2, each vertex of 
the shell / — 1 is linked to q — 1 new vertices. Note that the vertices in the last shell 
n have degree q n = 1, while all the other vertices have degree qi = q, I = 0, . . . , n — 1 
(the shell I = is represented by the single root vertex (0)). 

The BL of degree q is instead defined as a tree in which any vertex has degree q, 
so that there is no boundary and no central vertex and, as a consequence, the main 

J We will see soon that, both CT's and BL's have little numerical availability: on one hand a CT is 
very sensitive to the boundary conditions while, on the other hand, a BL, due to the fact that is an 
infinite graph, cannot be simulated (represented) on a PC. 

§ Note that nowadays the name Bethe Lattice is often used to indicate the regular random graph, 
which is a tree-like graph, but not an exact tree (a tree-like graph is a graph containing only long 
cycles so that locally it looks like a tree [4])- However, in this paper we will reserve the name Bethe 
Lattice to indicate an exact (and then infinite) tree. 



Cayley Trees and Bethe Lattices 



3 



difference between CT and BL is simply that CT is finite while BL is infinite: 

CT: \V\,\E\<oo, (1) 
BL: \V\,\E\=oo. (2) 
Note that Eqs. and ^ imply an important difference for the average connectivity 
c (the connectivity of a given vertex is defined as the number of edges emanating from 
it) between a CT and a BL. In fact, for any finite tree, and in particular a CT, it holds 
|V| = \E\ + 1. Therefore 

CT: c = 2-A (3) 

BL : c = q, (4) 
where in deriving c for the CT case we have used c = 2|J5|/|V| (valid for any graph). 
In Sees. Ill and IV we will see that the difference between Eqs. ([3| and Q has a 
dramatic consequence in statistical mechanics. 

In probability theory, an important distinction is in order between a sequence 
of probability spaces of increasing size that eventually diverges, and a probability 
space that is infinite by definition. Similarly, in statistical mechanics, one can be 
more interested in studying the thermodynamic limit of the density free energy of 
the system, or else in studying the physical properties of a system which is defined 
from the very beginning as an infinite space (physical or abstract). Despite only the 
former kind of infinity leads to a constructive theory of statistical mechanics and seems 
physically relevant (in the real world nothing is really infinite, neither the universe), 
the second kind of infinity may still be, not only mathematically convenient, but even 
physically important (we will see this soon). From Eqs. and ([2]), we see that, when 
we study a model of statistical mechanics built on a CT, we can have access only to 
the thermodynamic limit of the system, while when we study a model on a BL, by 
definition, we have access only to the physical properties of the model as defined over 
an infinite space. Even tough most of the models in physics show equivalence between 
the two different "kind of infinity" , in the case of CT and BL such equivalence is 
lost. The reason for such a difference is easily seen even without entering into the 
details of a specific model. In fact, when we study a model on the CT we need to 
specify the boundary conditions of the model, while on the BL, by definition, there 
are no boundary conditions. Given a CT of degree q > 2 with n shells, the number 
of vertices on the l-th shell is given by Ni — q(q — l) i_1 , while the total number of 
vertices of the CT is N = q((q — l) n — l)/(q — 2). We see therefore that the ratio 
N n /N of the CT, in the thermodynamic limit, N — > oo, does not approach zero (for 
q > 2). This situation is very different from what happens in a d-dimensional regular 
lattice box, where the ratio of the number of boundary vertices (vertices on the d — 1 
dimensional surface), with respect to the total number of vertices, for N — > oo reaches 
zero as fast as l/N 1 ^. Therefore, while the thermodynamic limit of a model built on 
increasing boxes subsets of Z d , is equivalent to the physical properties of the model 
defined on the infinite space Z d , the boundary conditions becoming negligible, the 
thermodynamic limit of a model built on increasing CT's is not equivalent to the 
physical properties of the model defined on the infinite space BL where, by definition, 
there are no boundaries. No matter how large N is, the model built on the CT will 
heavily depend on the boundary conditions, a feature that makes the model on the 
CT rarely representative of a real world physical system, so that the model on the BL 
is often preferred. In the next Sections the non equivalence between CT and BL will 
be made concrete with the example of the Ising model. 
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3. Ising model on Cayley Trees and on General Trees 

Given a CT (V, E) of degree q > 2 and n shells, we want to analyze the Ising model 
built on it, having Hamiltonian 

H [{<*}] d = - J - H °H ^ ( 5 ) 

(ij)eE i 

and partition function 

Zr /gJ2e-? H ^\ (6) 
Wi} 

where j3 = l/T is the inverse temperature, J the coupling constant, Hq the external 
field, the er's= ±ls are the spin variables, and Z; stands for the partition function of a 
CT with I shells. Let us consider free boundary conditions, and, for simplicity, the case 
Hq = 0. Since the CT is finite, we can in particular start to perform the summation 
of © by summing over the boundary spins, i.e., by summing over the spins on the 
n-th shell. By using cosh(/3Jer) = cosh(/3J) we get the recursive Equation 

Z n = Z n _ t [coeliGS J)]*" , (7) 
from which, by iterating, we arrive at 

Z n = [cosh(f3J)] N " +N "- 1+ - +Nl , (8) 

that is 

Z n = [cosh(/3J)] lv] . (9) 
From Eq. Q we see that the density free energy / 

-?f d = f lim l^l = l og [cosh(^J)] (10) 
|y|^oo \v\ 

is an analytic function of (3 for any j3 < oo, therefore the Ising model on the CT does 



not give rise to a spontaneous magnetization. Eq. ( 10 ) has actually a more general 
counterpart, as it holds for any tree, i.e., with no cycles. This can be easily seen, for 
example, by using the high temperature expansion of the free energy |11) . By using 
the high temperature expansion, for any graph G = (V;E), f can be written as 

-/?/= lim ^log[cosh(/3J)]+^ G , (11) 

|V|->oo | V | 

where (fQ is the non trivial part of the expansion and is defined as the sum over all 
the closed non overlapping paths C of G of tanh(/3J)^ c ', where 1(C) is the length of 
the closed path C. Now, if G is a tree, there are no closed paths, therefore cpa = 0, 
and from Eq. ([lj it follows 

-/?/ = log[cosh(/?J)] lim (12) 

The last factor |S|/| V| is nothing else than c/2, c being the average connectivity of G. 
By using Eq. ^ we see that the free energy of any tree does not depend on the details 
of the graph. In conclusion, the thermodynamic limit of the Ising model built on trees 
does not have a spontaneous magnetization. However, it can be shown |H1 [TO] that 
the free energy density / of a CT with q > 2 is a non analytic function of the external 
field h for T < Tbl, where Tbl will be introduced in the next Section. 
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Figure 1. Example of (a portion of) an irregular Bethe Lattice G of degree 
q = 3 with the root vertex (0) having only 2 first neighbors. The numbers (1) 
and (2) in the Figure represent the vertices in the shell n = 1, the numbers 
(1,1), (1,2), (2,1), and (2,2), the vertices on the second shell, and so on. Up 
to a change of names of the vertices, one has G = G\, where G\ is the (infinite) 
subgraph in the shaded area. 



4. Ising model on the Bethe Lattice 

We now illustrate how to solve an Ising model on the BL by using two methods which 
are free of ambiguities. 

4-1- Self-Similarity method 

Let us consider for simplicity a BL lattice G with degree q — 3 with the exception of 
one single vertex (0) that instead has degree 2 (see Fig. 1). We introduce this little 
irregularity on our BL just to make things simpler (later on we will show how to restore 
the fully regular BL). From the root (0) emanate 2 edges pointing at the vertices 1 
and 2. Since G is infinite, it turns out that the two equivalent infinite subgraphs G\ 
and G2 that we obtain by eliminating from G the vertex (0) together with its two 
edges, are also both equivalent to the original G, i.e., up to a change of name of the 
vertices, we have the self-similarity G — G\ = (?2- Note, furthermore, that G\ and 
G2 are each other disconnected. If we define the conditional partition function of the 
system Z(a ) with respect to the root (0) as 

Z(a ) d ^ ]T e-^K-'}], (13) 

{<x;,i^0} 

where H is defined similarly to Eq. ([5]), by using the self-similarity and the fact that 
Gi and G2 are disconnected, we get (again for simplicity we consider here only the 
case with no external field H = 0) 

Z(a ) = e 0Jao(<Tl+a ^Z(a 1 )Z((T 2 ). (14) 



One can feel unease with Eqs. (13) and (14) due to the fact that G is infinite, as well 



as H, and the Z's. However we can get rid of the Z's and any ill defined quantity 
when we consider the finite ratio P(— )/P(+) = Z(—)/Z(+), where -P(co) stands for 
the probability that the spin at (0) has value o~q. If we define 

-2h def P{~) n ,x 
e - p{+y (15) 
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from Eqs. (14) we arrive at the equation for h 



2/3.7-4/1 , „-2/3J , o p -2h 

e - 2h = - — — (16) 

e 2/3J _|_ e -20J-4h _|_ 2e~ 2h ' 

Eq. ( 16 1 has always the trivial solution h = corresponding to a zero magnetization. 



However, by expanding Eq. ( 16 1 for small h, it is easy to see that a non trivial solution 
h =/= is present when T < Tbl , where Tbl is the critical temperature given as solution 
of the equation 

(q- l)tanh(/3J) = 1, (17) 

which has solution as soon as q > 2. In other words, the value q — 2 represents the 
critical number of neighbors above which in the system there exists a phase transition. 
We can now physically understand why, unlike the BL, in the CT we cannot never 
have a spontaneous magnetization. From Eqs. ([3| and Q we see that, unlike the BL, 
on average, no matter how large the degree q of the CT is, the connectivity c of the CT 
is always strictly below the value c = 2, therefore, in the CT, no matter how large its 
degree q is, on average any spin is surrounded by an insufficient number of neighbors so 
that the system cannot have a spontaneous magnetization (at zero external field H) [| 
Similar conclusions apply to the Potts model [10J. The average magnetization of the 



spin do can be calculated from Eq. (15 1 and gives (ctq) = tanh(/i), while, the average 
magnetization for the fully regular BL with q = 3 is simply (a ) = (a) — tanh(/jX 3/2). 
Notice however the necessity to consider the irregular BL: self-similarity, and then the 
possibility to have an equation for the effective field h, applies only to the irregular 
BL, not to the fully regular BL. The physicist familiar with the Cavity Method (see 
next subsection) recognizes in this step another way to see that h is the strength felt 
by a spin due to all the other spins in the absence ( "the cavity" ) of the spin itself. 

We end this subsection with the following critical note. In many textbooks of 
statistical mechanics, as well as in many papers that deal with a BL (regular or not), 



in the place of Eqs. (14) or (U6|, we find respectively 



Wo) = E ^ Ja ^ + ^Z n {a x )Z n {a 2 ), (18) 

(Tl,(T2 



and 



£ 2f3 J-4h„ + e -2f3J + 2e -2h„ 
~~ e 2/3J _|_ e -2pj-4h n ZjZ 2 e -2h n ' ' ^> 

and it is often wrongly said that Z n stands for the partition function of that finite 
portion of the BL having only n shells. From what we have seen all above, it should 
be clear that such an approach for the BL is conceptually quite wrong: Z n cannot be 
interpreted as the partition function of a finite subgraph of the BL, otherwise for Z n 
we would get again Eq. Q. At this level, the only correct meaning we can attribute 
here to Z n (a) is that of the conditional partition function of an infinite subgraph 
self-similar to that associated to Z n +±(a), as showed above, and, most importantly, 
Z n+ i(a) and Z n (a) are the same thing. In this context, the index n appearing in 



Eqs. (|18j or ( 19 ) is a quite misleading symbol. More precisely, one should use instead 



Z(a n +i) and Z(a n ) but, as a consequence, for the ratios we would get the same value 



regardless of the index: h n = h n+ i. Rather, looking at Eqs. (14) or (16), as a mere 



numerical tool, it can be convenient to introduce the recurrent Eqs. ( 18 1 or ( 19 1 since 



|| More precisely, since in the thermodynamic limit for the CT it holds c = 2, from Eq. l |l7| applied 
with q = c = 2, we see that the Ising model on the CT is critical only at T = 0. 
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its fixed points are equivalent to Eqs. (14) or (16), respectively. A more rigorous and 



general meaning to h n can be attributed by following the next measure-theoretical 
approach. 

4-2. Kolmogorov' s condition 

A more rigorous approach to the BL comes from the Kolmogorov consistency theorem 
(see e.g., |12j and [13] ) by which we can avoid to introduce any ill defined quantity 
but, most importantly, as we will see, the method attributes a rigorous and physical 



meaning to Eq. ( 19 ) and to its generalizations. 

From the point of view of probability theory, solving the Ising model on the BL 
amounts to find all the marginal probabilities of an infinite probability space, i.e., a 
probability space characterized by an infinite number of random variables (in our case 
the spins with indices in V), a suitable sigma-algebra |12j . and a measure fi. The 
marginal probabilities can be calculated in turn from an infinite set of given finite 
dimensional marginal distributions \i n = /Xiy- n , where V n is the set of vertices that are 
at distance not greater than n from 0. Note however that, when we consider fj, n , we 
are not dealing with a finite disconnected subgraph of the given BL; the spins of V n 
that are located on the shell n are in fact in turn connected to the spins of V n +\. 
The key point to be used is that, given the values of the spins on V \V n , the spins 
on V n are conditioned only by the neighboring spins, i.e., the spins of the n + 1-th 
shell. In turn, given a spin <7j, with i £ V n , the effects of all its neighbor spins that 
are on V\V n , can be encoded via a single effective external field hi acting only on cr, . 
Therefore, for the /i„, we look for solutions of the form 

* G V n }) = e -^»+£« 6W r B h ^/Z n (20) 

where H n is given by Eq. W n — V n \ V n -x stands for the set of vertices of the shell 



n, and Z n is the normalization constant. Physically, Eq. (20) represents the first step 
of the Bethe-Peierls approach [5] (nowadays also known as Cavity-Method in physics 
[IB], or Belief-Propagation in computer science (18 ) in which the effects of all the 
spins other than those of the set V n are encoded in the fields {hi} to be determined 
self-consistently. From a strict probabilistic point of view, Eqs. pO] ) and ^ represent 
the definition of the Ising model itself on the BL. Now, as we have mentioned in Sec. 
II, the distinction between a sequence of probability spaces of increasing size and an 
infinite probability space is important. The sequence of probability spaces may or 
may not converge to anything, and even if it converges, it may happen that it does 
not converge to a probability space. The tool to investigate whether a sequence of 
increasing probability spaces converges to an infinite probability space is provided 
by the Kolmogorov consistency theorem [T2]. It should be clear that on an infinite 
space like the BL, the probability that a given configuration of spins \o~i,i £ N} (a 
point-like event) is realized is zero: (i({o~i,i £ N}) = 0. Therefore, when one considers 
an infinite probability space, the introduction of a proper sigma-algebra is not just a 
formal necessity. However, since here we do not need to work directly with the measure 
[i of the infinite space, for what follows, we can skip the introduction of the sigma- 
algebras. The Kolmogorov consistency theorem says that a sequence of probability 
spaces (V n , ji n ) converges to the probability space on BL, (V,/i), if for any n > 1 one 
has 

^ fi n +i(Wui G V n+ i}) = Hn{{o-i,i £ V n }). (21) 
{<Tt,iew n+ i} 
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Eqs. (20) and (21) give the functional equations for the fields hi,i € W n . 

Due to the tree-like structure of the BL, Eq. (21) factorizes over independent 
branches allowing for a fundamental simplification. Let us apply Eqs. (20) and (21) 
to the same model analyzed in the previous subsection (a BL with q = 3 in which 
the root vertex (0) has only two first neighbors) [HI [TS] . For any given i G W n we 
indicate with i\ and «2 the indices of the first two neighbors of io that belong to W n +i- 
We have (H = 0) 



-!3H n {{ai,l&V n }) 



3 -/3ff„({<r ; ,;6V„}) 



n 
n 



(22) 



From Eq. (22 ), for any io € W n and for any values of o"j , we have an equation for the 
field hi, 



Ho as a function of the fields h it and h i2 



E 



a PJcT io (<Ji 1 +cr i2 J+Zljj CTij^+Zlj 



Zn+1 „ 

<ri lt <T i3 

from which, by evaluating the cases Oi Q = +1 and Ui = — 1 we get 

e 2/3J-h il -hi 2 e -2l3J+h il +hi 2 , gfe 4l - 



(23) 



It is immediate to verify that Eqs. (23) reduce to Eq. (19) for the choice /i; = /i^ = 
— hn- Besides to be rigorous, this approach, based on the Kolmogorov consistency 
theorem, shows us that a possible non homogeneous solution for the effective fields 
{hi}, periodic or not, is not an artifact of the theory: an effective field which depends 
on the index vertex is in correspondence with a non homogeneous marginal probability 
which depends on the index vertex too. 

In the context of the Cavity and Belief Propagation methods, Eqs. ( 23 ) are better 
known in another form more suited for an intuitive interpretation of the effective fields 
as messages h^j passing from a vertex i to a neighbor vertex j; see Fig. (2). By 
using tanh _1 (x) = 1/2 log[(l + x)/(l — x)], valid for |at| < 1, it is easy to see that Eqs. 
( 23 1 can be rewritten as 
-l 



= tanh 



tanh(/3 J) tanl^/i^)] + tanh 1 [tanh(/3 J) t&nh(h k ->i)] ■ (24) 

More in general, the equation for a an Ising model built on a generic BL, regular 
or not, having generic couplings Jij and an external field Hq, reads as 

h^ = ^ tanh -1 [tanh(/3 J;,,) tanh(/3iJ + h k ^i)] , (25) 

where di stands for the set of the first neighbors of the vertex i. 

The developments about the convergence and algorithmic issues around Eqs. ( 25 ) , 
as well as their applications in physics, computer science, and statistical inference 
are huge (see e.g. [TB], [T7], [IE])- Here we wanted just to stress the equivalence 
between the Cavity/Belief-Propagation approaches and a rigorous method based on 
the Kolmogorov consistency theorem. 
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Figure 2. Graphical message-passing interpretation of the effective fields. Each 
field is in correspondence with an edge = S E (recall that our BL 

G is an u ndir ected graph). The field/message foj—j.;, emanating from i, as seen 
from Eq. ■ is a function of all the other fields/messages arriving to i: h 
and The correspondence between the fields appearing in Eqs. \l'->\ 



and 



those of Eqs. IMl is: h t 



and hi. 



h^ . Note that, in 



general, we can attribute a direction to the fields/messages h^^i even if the graph 
G is an undirected graph (i.e., for any S E we have = S E). 



5. Last observation 



From the previous Section, we see that we could re-interpret the (infinite) BL, 
as the thermodynamic limit of (finite) CT's having specific "boundary conditions" 
determined by the fields hi solution of Eqs. (20) and (21 1, as shown in [TS]. However, 
this point of view is a quite poor physical one (a system whose thermodynamic limit 
exists only for a specific boundary condition) and, pedagogically, very inconvenient. 



6. conclusions 



We have reviewed critically the concepts and the applications of CT and BL in 
statistical mechanics emphasizing their very different features. We have pointed 
out that careful must be paid especially in the case of a BL, an infinite space, 
where serious misuses and misunderstandings are currently seen in both textbooks 
and journal papers due to a ill mathematical approach to the BL. We have then 
illustrated for the BL case two alternative approaches which are rigorous and free 
of dangerous ambiguities based respectively on self-similarity and the Kolmogorov 
consistency theorem, pointing out the link of the latter with the Cavity and Belief 
Propagation methods, more known to the physics community. We hope that this 
critical review paper, where concepts and tools from physics and mathematics (too 
often kept apart) are used, might reduce the quite widespread misuse and conceptual 
errors around CT and BL. 
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